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SUMMARY 


We  consider  maltl-etage  proceoaes  Involving  both 
zero— sum  and  non-zero  sum  games.  Using  the  con¬ 
cept  of  "games  of  survival'*  we  derive  approxlraat 
solutions  for  both  classes  of  multi-stage  games 
under  various  realistic  assumptions. 
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^1.  Introduction 

In  the  first  paper  of  this  series,  [0.  Me  conslierel  i  multi¬ 
stage  decision  process  of  simple  and  yet  general  type  inJ  shcwel 
that  an  approximate  solution  of  plauslDle  and  Intultl/e  sort  co 
be  obtained  under  certain  reasonable  aBSumptlons. 

In  tnls  paper,  we  shall  consider  a  more  dll’Mouit  .'lias  of 
problems,  Involving  conflict  between  two  groups.  In  many  situation."! 
of  Importance,  one  group  may  be  considers J  to  oe  tne  Inanimate 
forces  of  nature. 

•  Summarized  ty  the  phrase  "the  perversity  of  Inanimate  cDfejts". 
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When  the  two  groups  are  In  direct  conflict  we  have  the  theory 
ol  zero— sum  games  to  assist  us.  Wfien  tne  two  groups  are  partly 
apposed  and  partly  not,  we  encounter  the  problems  of  non— zero  sum 
games,  where  there  Is  essentially  no  tr»eory  to  guide  us.  Never¬ 
theless,  we  shall  snow  that  under  certain  conditions  we  can  once 
again  obtain  approximate  modes  cf  procedure  which  seem  eminently 
reasonable . 


^2.  Description  of  the  Mul  tl-Stage  r^rocess 

We  shall  consider  a  multi-stage  {recess  cf  the  following  type, 
v^hlcn  we  sfiall  call  a  "game".  There  are  two  protagonists,  whom  we 
cnail  call  "{layers",  named  rather  prosaically  P  and  Q.  Tne  first 
(f  these  {layers,  P,  has  a  choice  of  M  different  plays,  which  we 
shall  designate  by  the  numbers  1,2,...,M,  and  the  second  player,  Q, 
has  a  choice  cf  N  different  piays,  denoted  by  1,2,...,N.  If  P  chooses 
the  1—  wf  fils  c.holces  and  Q  the  J~  of  his  choices,  then  P  receives 
a  quantity  a.  ,  and  Q  a  quantity  b.  These  quantities  may  be  nega- 
tlve,  In  wnlch  case  a  loss  Is  sustained. 

We  can  now  consider  trds  situation  to  repeat  Itself  fc r  a 
I'lnlte  unbounaed  numner  of  stages  and  pose  the  problem  of  deter¬ 
mining  tn^  iptlmal  mode  of  play  for  each  player  under  the  assumption 
trat  efH'h  wls[ies  to  maximize  his  over— all  return. 

d  ■  ^tage 

The  simplest  case,  and  unfortunately  the  ^.nly  case  In  which 
a  Bbllalact  ry  theory  exists.  Is  that  w.here  b^  ,  -  “^IJ’ 
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to  say  the  playera  are  In  direct  opposition  since  one's  gain  Is 
the  other's  loss. 

We  shall  begin  with  the  discussion  of  the  single-stage  process. 
It  Is  clear  that  the  determination  of  optimal  play  Is  trivial  If 
either  of  the  players  is  required  to  move  before  tne  other.  The 
only  interesting  case  la  that  where  both  playera  are  required  to 
move  simultaneously. 

In  these  circumstances  they  protect  themselves  by  mixing 
their  choices.  Les  us  assume  then  that  P  chooses  toe  1—  play 
with  probability  Xj^  and  Q  his  J—  choice  with  probability  yj  . 

The  vector  x  ^  (x ,Xa , . . . ,x^)  specifies  P's  probability  distribu¬ 
tion,  and  y  •  (y i »yf i • • • »yp)  Bpeclfies  Q's  probability  distribution. 

The  expected  return  for  P  will  be 
N 

(1)  Ep(x,y)  -  ^ij^l^j  ' 

and  the  expected  return  for  Q,  EQ(x,y}  wli.1  be  tne  negative  of  this, 

(2)  E^Cx.y)  -  -Ep(x,y) 

The  first  player,  P,  will  choose  his  prob.iblllty  distribution 
X  so  as  to  maximize  Ep  ana  the  second  player  Q  wKl  cnoose  y  so  as 
to  minimize  Ep. 

We  define  two  VtiAues 

(3)  Vp  -  Min  Max  E  (x,y) 

y  X 


and 
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(4)  .  Mjjx  M^n  Ep(x,y) 

In  each  cose  the  variation  Is  .  ver  tne  regions  defined  by 

M 

(3)  (a)  Xj^>0,  ^  -  1, 

(to  yj>o-  yj  ■  >■ 

Tne  first  value,  Vp,  Is  the  expected  return  to  P  If  Q  Is  required 
to  choose  y  before  P  chooses  x,  and  the  second  Is  tne  expected 
return  to  P  If  P  must  choose  x  before  Q  chooses  y. 

It  Is  a  remarkable  fact,  tne  celebrated  mln-max  theorem  ol' 

Von  Neumann,  ”^j  ,  the  basic  theorem  c.f  the  theory  of  games,  that 

(6)  Vp  -  V^. 

The  Inter} retatlon  of  this  result  Is  that  P  can  announce  x  In 
advance,  and  Q  likewise,  without  either  gaining  from  this  advance 
knowledge . 

/h  .  Zero— Sum  Games  -  Finite  Peso urces 

In  many  situations.  Involving  multi-stage  play,  the  above 
model  Is  not  satisfactory.  Tnls  1  .s  particularly  true  In  multi¬ 
stage  prcceuses  where  b  tn  sides  have  finite  resources.  Here  the 
game  auL^mallca i ly  terminates  when  eltner  player  has  no  resources. 


Let  us  fissome  then  that  each  side  now  piays  to  ruin  the  other, 


-  b- 


with  the  game  continuing  until  one  or  the  other  player  la  bank¬ 
rupt.  Let  p  repreaent  the  Initial  amount  poaaeBaed  by  P  and  q  the 
Initial  amount  poaaeaaed  by  Q.  We  define 


(1)  f(p,q)  -  the  probability  that  P  aurvlves  Q  when  P  starts  with 

p,  Q  wl th  q  and  both  aides  use  optimal  play. 

Games  of  tnlo  variety  are  aptly  called  "games  ol'  survival",  cf  , 


Games  of  Lurvlvai  -  Ma t/jernat  1  ja  1  Forrnjiatlon 


Since  P  wins  what  Q  loses,  and  vice  versa,  the  total  quantity 
of  resources  In  the  game  remains  cons  ant,  and  equal  to  n-  q-  ■  N, 

vj  .J 

the  Initial  total,  to  specify  the  state  f  tlie  game  it  Is  sjfJ'lclent 
then  to  state  tne  amount  of  resourcea  possessed  by  P. 

We  replace  !'(p,q)  by  the  function  of  one  variable  f(p).  Let 
uo  now  derive  a  functional  equation  I^r  l'(p).  Enumerating  tlie 
pOBslb.le  outcomes  of  one  stage  of  play,  we  see  that 
N 

( 1 )  f (pj  "  21  ^1^  , )  * 

ITJ-I  ^  " 

wnere  x  -  (x  i  ,Xa,  •  •  ■  .x^j  and  y  -  (yi.y? . y^j)  are  the  [rthablllty 

distributions  ol'  P  and  Q  determining  the  Initial  flay. 

Slrice  P  p'lays  to  maximize  l’(p)  and  Q  to  minimize  f(p),  we 
obtain  the  equatl^-n 

N  _ 

(2)  I  (p )  »  K  ix  Min  y  x^y  .1  (l  J 

X  y 

-  Min  Max  x.y  ,f  (p^a  .  ) 

y  X  frj-i 
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TJ1I3  equcillon  Ic  Vai'.  1  for  0<iKN .  We  al3o  have 

(3)  r(p)  -  1.  r  >  N 

r  ( f  0  “  0 ,  p  <  0 . 

.  Appra.xl ma  te  So  lution 

A  solution  of  (2)  yields  f(pj  and  thus  the  vector  x(p),  y(p), 
the  probaPlllty  distributions  determining  the  Initial  play.  We 
wltn  to  dt'termlne  ap[  rox Ima te  value.s  f  r  x(;.)  <ind  y(p)  under  cer¬ 
tain  asotjnptlono  concerning  the  sequence  of  pay— c.ffs, 

We  assume  that  no  single  play  creates  any  appreciable  change 
In  the  state  of  the  game,  which  Is  to  say  that  a^^  Is  small  com¬ 
pared  to  p. 

We  then  write 


( 1 )  f  j  )  -  f  (i  )  >  '  (p)  . 


Then  {1.2)  takes  tiie  form 

M,N 


(2)  f  (p)  -  Max  Min  '21 

X  y  iTj-i 


M.N 


Ml 

y 


n  M«x  f  ]5  X , 

X  f7j^i  ^ 


or 


(3) 


0  -  Mnx  Min  ^  1' '  (p) 

X  y 

-  Mir.  Max  [  f  '  (p  j 
y  X 


M,N 


?7j.i 

K_j^N 


p-f.9:5 


Lee  as  now  a33a’ne  f.irtner-  tnnt,  ae  Is  trie  In  all  realistic  sltua— 
tlona,  It  definitely  pays  P  to  have  a  larger  Initial  quantity  of 
resources,  that  Is  to  say  f'(p)  >  0  for  all  p. 

T/ien  (3)  Is  replaced  by 

-  m,n 

(4)  0  ^  Max  Min  ^  T”  ^4  iJ^4y  ,  j 

y  rj-i  ‘J  ‘  J 

_  M,N 

^  Min  Max  ^  >  ^4  i^^y  ,  j 

y  X  T7J-1  ^  J 

Tne  meaning  of  this  equation  Is  that  fi  r  large  p,  with  a 
large  number  of  plays  remaining  until  the  end  of  the  game,  the  play 
la  approximately  the  same  as  that  employed  In  tr.e  single-stage  pro¬ 
cess  where  both  players  wish  merely  to  maximize  the  expected  return 
from  one  play. 


This  approximate  solution  has  precisely  the  same  structure 
as  that  given  for  the  one— person  process  In  . 

The  Important  feature  of  (4)  is  tiiat  we  obtain  the  same 
approximate  equation  regardless  of  tne  significance  of  f(p)-  C(,n- 

sequently,  even  In  situations  where  f(p)  Is  not  completely  deter¬ 
mined,  as  frequently  occurs  In  re-allstlr  nlbuhtlcns,  we  kn^w  that 
we  possess  a  go^d  approxlmatl^in  to  optimal  piay. 

Si.  Non— Zero  Sum  G'mes  -  Games  ol  Survival 


Let  us  now  turn  to  a  ulscusslon  of  the  more  general  situation 


where  b.  ,  /  —a.  , . 

^  ^  %J 

of  optima  1  play  In 


Here  ttiere  is  no  tficory  for  tne  determination 
•.  single-stage  process.  C -nsequently ,  we  shall 


turn  Immediately  to  the  dlacusslon  of  multl-atage  proctoaes.  We 
aaaume  once  more  that  both  playera  atrlve  to  ruin  the  other  and 
continue  the  game  until  tnla  occura. 


Let  p  be  the  Initial  amount  poaaeaaed  by  P  and  q  the  amount 
possessed  by  Q,  and  Introduce  the  function  defined  by  (4.1). 


(1) 


This  function  satisfies  the  functional  equation 

M^N 

(a)  f(piq)  -  Max  Min  V  Q  > 

^  y  1 1 J "  ^ 


(b)  f{p,q) 


Min  Max 

y  X 

1, 

0, 

V2, 


M^N 


P»q>0, 


p  >  0,  q  <  0 
p  <  0,  q  >  0 
p  a  q  a  0  . 


.^0.  Approximate  Solution 


Let  ua  assume  that  and  b^^  are  both  negative,  so  that 
we  are  dealing  with  an  attrition  process,  and  that  and  bj^j  are 
small  compared  to  p  and  q. 


We  write 


(1)  P(p-^a^j,  q+b^j)  -  r(p,q)  ^ij^p  ^IJ^q  * 


Then  (7.1a)  yields 


N 


(2)  0  “ir  Max  Min  [  f,^  5“ 

X  y  TTj-l 


M,N 

•  Sj.:  " 


N 


N 


Min  Max 
y  X 


aijx.yj  *  fq  b^jX^yj 
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As  before  we  asBone  mat  >  C,  f^  <  0.  Since  I  a,  ,x,y,, 

P  Q  ij  1  J 


^  ^IJ^l^J  negative.  (2)  yields 


(3)  t 


^  Max  Min  'Y  ^11^1^1^ 

q  X  y  ITj-l  ^  TTJ»1  ^ 


^  Min  Max  ^ 
y  X 


ri  f  n 

rrj-i 


ri  f  I't 

r;j-i 


-I 


The  Interpretation  of  this  equation  Is  that  th  sides  play 
approximately  so  as  to  maximize  or  minimize  respectively  the  ratio 


M,N  M,N 

(")  ^ 


That,  under  the  assumptions  on  a.  ,  and  b,  ,, 

1 J  1  j 


(5)  Max  Min  R(x,y)  ■  Min  Max  R(x,y) 
X  y  y  X 


l8  a  theorem  also  due  to  Von  Neumann,  and  recently  established  In 
a  different  manner  by  Shapley,  JjjJ  • 

^9.  A  Rationale  for  Non-Zero  Sam  Games 


The  Importance  of  the  above  result  resides  in  the  fact  that 
It  furnishes  us  a  motive  for  using  R(x,y)  as  a  universal  pay-off 
function  for  nun— zero  sum  games.  Whether  or  not  It  Is  to  be 
accepted  In  any  particular  situation  will  depeno  on  other  proper¬ 
ties  of  the  game. 


V 
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